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Abstract 

We introduce a new characteristic of jets called mass area. It is defined so as to measure 
the susceptibility of the jet's mass to contamination from soft background. The mass area 
is a close relative of the recently introduced catchment area of jets. We define it also in two 
variants: passive and active. As a preparatory step, we generalise the results for passive and 
active areas of two-particle jets to the case where the two constituent particles have arbitrary 
transverse momenta. As a main part of our study, we use the mass area to analyse a range 
of modern jet algorithms acting on simple one and two-particle systems. We find a whole 
variety of behaviours of passive and active mass areas depending on the algorithm, relative 
hardness of particles or their separation. We also study mass areas of jets from Monte Carlo 
simulations as well as give an example of how the concept of mass area can be used to correct 
jets for contamination from pileup. Our results show that the information provided by the 
mass area can be very useful in a range of jet-based analyses. 
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1 Introduction 



In the present era of the LHC, as in the times of all precedent hadron colliders, jets remain 
fundamental objects of interest |1, 2]. Their importance extends far beyond the domain of 
physics of strong interactions, where they are used as representatives of partons participating in 
a hard process. They play also a significant role in a whole range of processes involving decays 
of heavy particles. Those include, for example, a top quark decaying into three jets, W/Z, Higgs 
boson or a hypothetical boson Z' decaying into two jets, as well as a variety of SUSY particles 
which readily decay into many-jet final states. 

A considerable effort is being made to improve our control on jets. On the theoretical side, 
this comprises, on one hand, improving the precision of calculations involving the canonical set of 
jet observables like the transverse momentum, mass or thrust. On the other hand new concepts 
are being developed including additional characteristics, like, for example, the catchment area 
of jets [3], or new analysis techniques based on subjets [4lfl4"]. 

Amongst a number of properties of a jet, its mass turns out to be important in many physical 
contexts. In the legitimate approximation of massless QCD partons, the jet mass arises due to 
its substructure. One source of this substructure is of course the radiation of gluons and quarks, 
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which leads to the well known distribution of mass of QCD jets with a significant fraction of 
jets with large masses. Consider, however, a process involving a hadronic decay of a heavy 
object of mass m. If this object, in addition, has the transverse momentum pt^>m,a. situation 
not unusual at the LHC, the decay products will end up in a single jet. The reconstructed 
mass of such jet will be an important emblem pointing to its origins. Moreover, such a fat 
jet can be analysed further with techniques involving study of the masses of its subjets. The 
jet-based reconstruction of heavy particles has been a subject of numerous studies devoted to 
decay of W Q2], WW scattering @], decay of top [BQIIGSIQI], Higgs PE] as well as SUSY 
searches 0QJQI1CE8] . 

The success of the above techniques depends crucially on the ability of precise determination 
of the mass of jets measured in experiment. In hadron colliders, however, particles that can 
contribute to the jet's substructure may also come from soft radiation unrelated to the genuine 
hard process of interest. Such radiation appears, for instance, due to independent minimum- 
bias collisions that happen in the same bunch crossing, a phenomenon known by the name of 
pileup (PU) . But even in the absence of pileup each hard process from single hadron-hadron col- 
lision is accompanied by soft underlying event (UE) which can easily modify the jet's transverse 
momentum by a few GeV |19p20j. 

A major step towards quantifying the effects of UE/PU and correct for them was made 
in [51ET], where the concept of the jet area was introduced, which is a measure of how much 
the transverse momentum a jet from a given clustering algorithm is prone to be affected by soft 
radiation. We briefly review the corresponding results in section [21 

In this paper, we introduce a related characteristic of a jet, which we will call the mass area 
and which will represent the susceptibility of a jet's mass to a soft background like UE or PU. 
In line with [3] we will introduce two types of the mass area: passive and active. The former 
will correspond to pointlike background whereas the latter will be appropriate to measure the 
susceptibility of the jet mass to the soft radiation which is diffuse and uniform. 

We will analyse the passive and active mass areas of jets from four modern clustering algo- 
rithms: kt [22H23], Cambridge/Aachen (C/A) [2IH23], anti-A* [2Z] and SISCone [2E]. The first 
three belong to the class of sequential recombination algorithms. They introduce a distance dij 
for each pair of particles and a distance diB for particle and the beam. The distances depend on 
the basic parameter, jet radius R. The algorithms start from computing the above distances for 
all final state particles. If the smallest distance involves two particles, they are recombined and 
replaced in the list of particles by the product of this recombination. If the smallest distance 
is that between a particle and the beam the particle is called a jet and removed from the list 
of entries. The procedure is repeated until there are no entries in the list. The SISCone algo- 
rithm belongs to a different class of the so called cone algorithms. They look for stable cones of 
radius R and subsequently apply the Tevatron run II procedure [22] to split or merge the over- 
lapping cones. All the above algorithms are infrared and collinear safe and are easily accessible 
via the Fast Jet package |30| I31|. Further details on each of them are given in section I3TT1 

In [3J the jet areas were calculated for the case of 1- and 2-particle systems. In the latter 
case the results were obtained in the limit of strong ordering of transverse momenta of the two 
particles. In this paper we relax the assumption of strong ordering and start by presenting 
in section [3] the corresponding general results for passive and active areas of 2-particle jets. 
Subsections 13.11 and 13.21 are quite technical. Though they contain very useful material, the 
reader interested in the main part of our study may skip them on the first reading. 

In section H] we introduce the concept of the mass area of a jet and define its passive (sub- 
section H2D and active (subsection 14. 3p variants. There, we also analyse their properties for the 
system of 1- and 2-particles. In particular, we compare results from the four algorithms and 
examine the dependence on the relative hardness of the constituents of 2-particle jets. At the 
end of each subsection we discuss the problem of logarithmic dependence of the mass area of 
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QCD jets on the jet's transverse momentum. We give it a quantitative description in terms 
of the anomalous dimension and compare the results across the jet algorithms. Throughout 
the paper we work in the small R approximation which is justified by the observation that the 
corrections from higher powers of R are accompanied by small coefficients |19p 32j . 

In section we turn to a study of jets simulated with Pythia. We illustrate how the features 
found for simple 1- and 2-particle systems help understanding mass areas of more realistic jets 
(subsection 15 . 1 [) . Then, we give an example of practical application of mass areas to correct jet 
mass for the contamination from pileup (subsection I5.2p . Finally, we summarize our results in 
section [6] and provide some extra details in two appendices E] and |Bj 

2 Essential definitions, notation and brief review of jet areas 

2.1 Passive area 

Consider a set of particles {pi} which are clustered with an infrared safe jet algorithm into 
a set of jets {</«}. Suppose now that we add to the set {pi} a single infinitely soft particle g, 
which hereafter we shall call the ghost, and repeat the clustering on the new set of particles 
{Piid}- Because we use an algorithm which is infrared safe and because our extra particle g has 
infinitely small transverse momentum this clustering will not change the set of jets {J{\- The 
ghost particle g can be either clustered with one of the real particles, in which case it ends up 
in one of the jets Jj, or it can form a new jet with g being its only constituent. 

The passive, scalai0 area of the jet J is defined [3] as the area of the region in the (y, <fi) 
plane in which the ghost particle g is clustered with J 

i T \ f 7 ii n i l\ t\ £ f T \ f 1 for 5 clustered with J . . 

a(J) = J dyd<f>f( 9M ,J), J) = | Q for I ^ dugtered w . th j . (2.1) 

Such definition provides a measure of the susceptibility of the jet to soft radiation in the limit 
in which this radiation is pointlike. 

For a set o particles that consists only of a single particle p\ the passive area of the cor- 
responding jet J\ is a(Ji) = nR 2 for all four jet clustering algorithms: kt, C/A, anti-/c t and 
SISCone. 

Adding a second particle p2 leads to the result which depends on the jet definition (i.e. jet 
algorithm and jet radius) and the geometrical distance between particles p\ and p2 in the (y, 4>) 
plane A 2 2 = (yi — y2) 2 + ((pi — 4>2) 2 - The analytic results for a(Ai2) of the harder jet in the limit 
Pa S> pt2 S> Aqcd S> pt g for all four algorithms were obtained in [3]I27|. In Fig. Q] (left) we show 
the corresponding functions, normalised to the 1-particle passive area. We notice substantial 
dependence on the algorithm especially in the region A12 < R where the two particles form a 
single jet. There, the areas from the kt and C/A algorithms are notably different from ttR 2 
and vary significantly with the distance between the particles. On the contrary the areas from 
SISCone and anti-kt are identical with the 1-particle area for A12 < R and in the latter case 
also for A12 > R. All results recover the correct limit of ttR 2 when A12 goes either to or 
to 2R. 

2.2 Active area 

Suppose that we add to the set of particles {pi} not a single ghost like in the case of passive 
area but a dense coverage of ghost particles randomly distributed in the (y, 4>) plane. Again, 

x The 4- vector passive area was also defined in [3]. Though we will not use it directly in the current study, we 
note as an aside that the concept of 4- vector passive area is implicitly present in the calculations of passive mass 
area of section l4~2l In particular, all the results from that section could be alternatively obtained with a direct 
use of 4-vector passive area. 
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Figure 1: Passive (left) and active (right) area of the hardest jet in a 2-particle event with 
Pt2 "C Pti and the interparticle separation A 12. All curves for passive areas as well as the anti-fct 
and SISCone curves for active area represent the analytic formulae obtained in [5J[27]. The 
active area results for the kt and C/A algorithms were computed using the Fast Jet 2.4.2 
package 



the original jets { Jj} are not modified, but, they can contain many ghosts which are clustered 
together with real particles. In addition, now ghost may also cluster among themselves leading 
to formation of jets with no physical particle - the pure ghost jets. 

The active scalar area of a jet J is defined [3] as a number of ghosts contained in this jet per 
the density of ghosts per unit area averaged over many sets of ghosts. If the number of ghosts 
from a particular ghosts ensemble {gi} clustered with the jet J is A/{ 5i }(J) and the number of 
ghosts from this ensemble per unit area is V{ gi \ then the active scalar area is given by 

A(J)= lim (A(J\{ 9i })) A(J\{ 9i }) = M{!h}iJ) , (2.2) 

where in addition to the limit of the infinite density of ghosts, the average over many sets of 
ghosts is taken. The latter is necessary since the ratio A/{ 9i }(J) depends on the particular 
set of ghosts even in the limit of high ^{ gi }- Therefore, one also defines the standard deviation 
of the distribution for the active area over many ghosts ensembles 

X 2 (J) = lim (A 2 (J\{g t })) -A\J). (2.3) 

\Jg — ?oo y 

The active area is meant to measure the susceptibility of a jet to the soft radiation which is 
uniform and whose density is high. 

Similarly to the scalar area also the 4-vector active area may be defined as 

A ^ J) ~ .jToo {A ^ J 1 {9i})) 9 > M J I id*}) = E > ( 2 - 4 ) 

where (ptg) is the average ghost transverse momentum. The 4-vector area will prove useful 
in section 14.31 were shall discuss the active mass area. For small jets, the scalar area and 
the transverse component of the 4-vector area are virtually equal, A(J) ~ At(J), and A^ is 
a massless vector which points in the direction of the jet. For larger, jets the 4-vector area 
becomes massive and its direction differs from that of the jet. 
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algorithm A/{ttR 2 ) 



S/(7Ti? 2 ) 



1-particle-jet ghost-jet 1-particle-jet ghost-jet 



h 

C/A 
SISCone 
anti-fe 



0.812 0.554 0.277 0.174 

0.814 0.551 0.261 0.176 

1/4 

1 



Table 1: Summary of the results from [3l[27] for active areas and their fluctuations in the case 
of 1-particle and pure ghost jets. The numbers for the kt and C/A algorithms where obtained 
from numerical study with Fast Jet [30|J31| whereas those for anti-kt and SISCone represent exact 
values from analytic calculations. All results are normalised to ttR 2 . The results for pure ghost 
jet areas are not shown for SISCone and anti-/cj. In the first case they depend strongly on the 
spilt-merge parameter, /, while in the second case the distribution has two peaks at and irR 2 . 

The active area can be studied numerically for any infrared safe jet clustering algorithm, 
most easily using the Fast Jet package [3U1ET]. In addition, the analytic results can be obtained 
in some cases for the anti-fcj algorithm and for SISCone. 

Unlike the passive area, the active area of the 1-particle jet may differ significantly from 
the naive expectation of ttR 2 . Firstly, in that it is in general a rather broad distribution over 
many ghost ensembles and secondly in that the average value of this this distribution may lay 
below ttR 2 . This is illustrated in table [IJ which summarises the results for the average active 
scalar areas of 1-particle and pure ghost jets and the corresponding standard deviations from 
four clustering algorithms obtained in [3JE7]. We see that the average values for the kt and 
C/A algorithms are significantly lower than ttR 2 with pure ghost jets having smaller jet area 
than the jets with 1 hard particle. Moreover, the values of standard deviations indicate that 
the distribution of active jet areas is rather broad. The anti-fct algorithm is special in that 
its 1-particle-jet active area is equal to the passive area ttR 2 and does not fluctuate [57] . For 
the SISCone algorithm, the active area of a single-particle jet can be calculated exactly [3] and 
it turns out that its value is four time smaller than that of passive area. The active areas of 
ghost jets for SISCone and anti-fcj exhibit somewhat more complex behaviour. For the former 
the results depend on the split-merge parameter, /, and for the latter the distribution has two 
peaks at and ttR 2 . That is why we do not show them in table [TJ 

As in the case of passive area, discussed in the preceding subsection, also here, adding a 
second particle to the system has a significant effect on the active area of the hardest jet. This 
is illustrated in Fig. [1] (right) for the case of pt2 <C Pn, which was considered in |3j. As we 
see, the behaviour depends significantly on the algorithm. The active areas from kt and C/A 
exhibit similar shape to the passive areas differing with the latter mostly by about 20% lower 
normalisation. The anti-fct, as expected, gives the same result for passive and active area. The 
most drastic change is seen for the SISCone algorithm for which the active area is almost factor 
four smaller than the passive one (c.f. table [I]). 

3 Areas for general case of 2-particle system 

In the current study we are interested in mass of a jet and the way it is affected by soft 
background. The main contribution to jet mass comes from its substructure. This substructure 
may originate, e.g., from QCD splittings. In this case, the results for the areas of jets consisting 
of two strongly ordered particles, obtained in [3] and reviewed briefly in the previous section, 
are adequate. However, if the two constituents of our simple jet come from a decay, then their 
transverse momenta are comparable and one expects such a jet to have different properties. 
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In order to be able to discuss the problems related to jet masses for the whole spectrum of 
cases between those two extremes of pt2 <C pn (QCD jets) and pt2 ~ Pti (jets from decay), as a 
preparatory step, we will generalise the results for jet areas from [3] to the case of two particles 
with arbitrary transverse momenta. 

It will be convenient to quantify the relative hardness of the two particles, p\ and P2, in 
terms of the variable 

minfp+i, Pfo) 

z = yF ,F ' , (3.1 

m + Pt2 

which, by definition, is always in the range < z < 1/2. 

The main difference with respect to the case discussed in the previous section will be that 
now, when the particles p\ and pi are combined, the jet J\2 may be centred anywhere between 
the positions of these two particles. Before, such jet was centred always at the harder particle. 
The exact values of (yj 12 , (f>j i2 ) will depend on the recombination scheme used as part of a jet 
definition. Out of several existing schemes, we adopt for the study presented in this paper the 
widespread E'-scheme which combines particles by simply adding their 4-momenta. Apart from 
being very intuitive and preserving Lorentz symmetry, it has been also recommended in |29j. 

For the 2-particle system, the centre of the jet will lie on the line segment bounded by the 
positions of the particles. Therefore, the results for mass areas from the four algorithms that 
we are going to study can depend only on the distance along this line, from one (any) of the 
particles to the centre of the jet. We will denote the distance from the softer particle by Aj. It 
will depend on A 12 and on the asymmetry parameter z. 

For convenience we also introduce the versions of A12 and Aj normalised to the jet radius 

xj = —. (3.2) 



R ' R 

Employing the above definitions, we may write the explicit formula for xj in the i?-scheme valid 
in the small R limit 

xj(x, z) oi (1 — z) x . (3.3) 

One notices that, since, according to the definition (|3,ip . z < 1/2, the softer particle is always 
further away from the centre of the jet than the harder one. The distance between the latter 
and the jet's centre being x — xj. In the limit of strongly ordered particle transverse momenta, 
xj — > x and the jet gets centred at the harder of the two particles. 



3.1 Passive areas 

Since the system under consideration is simple and the hardest jet may consist of at most two 
particles, its passive area can be calculated analytically. The result will depend on the order of 
clustering of particles p\, P2 and the ghost g. This order is different for each algorithm hence 
the passive areas will vary across them. As mentioned in the introduction, we work in the small 
R approximation. One consequence of that is that we treat the directions y and <j) in the (y, 4>) 
plane on equal footing. 



The kt algorithm with its 2-particle distance measure, dij = min(p|j,p|)A|/i? 2 , and beam- 
particle measure, diB = Pu, will always cluster the ghost first with either one of the particles 
Pi, P2 or the beam. In the latter case, the contribution to the area is zero. In the former 
case, the ghost clusters with the particle which is geometrically closer according to the distance 
Aj g regardless of the relative hardness of particles p\ and p2- Therefore, the result will be 
independent of z and will coincide with that found in [3] and shown already in Fig. [T] (left) of 
section |2~T1 The corresponding formula can be found in the appendix lAl 
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Figure 2: Representation of the passive area of the hardest jet in the system with two particles 
having arbitrary transverse momenta for various algorithms and interparticle separations. The 
dots represent the particles and the cross centre of a jet. The distances x and xj are defined 
in Eq. (|3.2|) and the critical values of x in Eqs. (|3.6p . (|3.7p . (|3.9p and (|3.13p . The asymmetry 
parameter z is given in Eq. (|3,ip . 



The Cambridge/ Aachen algorithm does not take into account the hardness of the particles 
undergoing the clustering but solely the geometric distance Ajj between them according to the 
measures dij = A?./i2 2 and diB = 1. The clustering of the system of two particles p\ and p2 
and the ghost proceeds as follows. If the ghost is closer than A12 to either of the perturbative 
particles then it is clustered first with the closer one. Subsequently, the particles p\ and P2 are 
clustered. If, however, the distance between the ghost and the closer particle is greater than 
A12 then the two perturbative particles are clustered first forming the jet Jyi centred at the 
point in the line segment between the positions of the particles p\ and P2 at the distance Aj 
from the softer particle. Then, the ghost may cluster with J12 if its distance to the jet's centre 
is smaller than R. Therefore, the area of a 2-particle jet in the C/A algorithm is a union of two 
smaller circles of radius A12, centred respectively at the particles p\ and P2 and the big circle 
with radius R centred at the jet J\2- 

The range < x < 2 consists of four distinct sub-ranges. The two critical values of x, which 
we denote as x c \ and x C 2 , correspond to the situations where one or two of the small circles start 
sticking out of the big circle, as depicted in Fig. [2](a) and (b). For x below x c \ or above 1 the 
results will not depend on the asymmetry parameter, z, and will be identical with those found 

The conditions for the critical values of x are given by 

Xd + xj = 1 , (3.4) 
2x c2 -xj = 1. (3.5) 
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Figure 3: Passive areas of the hardest jet in the system of two particles with arbitrary ratio of 
transverse momenta, as functions of the interparticle separation, x, for C/A (left) and SISCone 
(right). The parameter z is defined in Eq. (|3.ip . The value z = corresponds to strongly 
ordered transverse momenta of the two particles and the value z = 0.5 to the system of two 
particles of equal hardness. The SISCone result does not depend on the value of / parameter. 



In the limit of small R the approximate solutions have the following simple forms 

X c l(z) 

x c2 {z) 



2-z' 
1 

l + z ' 



(3.6) 
(3.7) 



The analytic result for the passive area from the C/A algorithm is given in appendix The 
corresponding curves are shown in Fig. [3] (left) for R = 0.6 and several values of the asymmetry 
parameters z. We note that the dependence on z is mild. In the limit z — )■ 0, x c \ — > 1/2 
and x C 2 — > 1, and one recovers the result for the system of two particles with strongly ordered 
transverse momenta from [3]. 



The SISCone algorithm looks for stable cones of radius R, which are the cones whose 
direction coincides with the I?-scheme sum of the momenta of the particles inside. Those cones 
which overlap are subsequently split or merged according to the Tevatron run II type [29J 
procedure. This procedure starts from ordering stable cones according to the scalar sum of the 
transverse momenta of their constituents, pt- Then, the pt shared between the hardest jet and 
the next to hardest jet that overlaps with it (with ptj) is compared with fptj, where / is the 
overlap threshold parameter. The cones are merged if pt > fptj and split otherwise. 

For x < 1 only one stable cone is found, with its centre between the particles p\ and p2- 
Any ghost within this cone belongs to the jet. Therefore the area is identical to that of a single 
particle jet. 

For 1 < x < 2 two stable cones are always found, centred at particles p\ and p2 respectively. 
In addition, for 1 < x < x C 4 a third stable cone is found containing both particles. The third 
cone is stable as long as the distance between the jet's centre and the softer particle is smaller 
than R. This gives the condition for x C 4 

xj{x c4: , z, R) = 1 , (3.8) 



8 



which in the limit small R leads to 

and since, according to the definition (|3.1|) . < z < 1/2, the above critical value stays in the 
range 1 < x C 4 < 2. As a next step, one has to check if the overlapping cones have a chance to 
be merged. As shown in Fig. [2(c), all the three cones overlap in the region 1 < x < x c ^. The 
central cone has the largest pt and the amount of pt shared with the left jet is pn since the two 
jets have only one common particle. The condition for merging the left and the central cone is 
Pti > fpti and it is always satisfied. Similarly, the right cone will always be merged with the 
middle cone. 

Therefore, in the region 1 < x < x C 4 the jet area will be given by the area of the union of the 
three circles, depicted in Fig. [2](c). In the region x C 4 < x < 2, only two stable cones are found 
with no common particle so they are never merged. The two particles will end up in different 
jets. The area of the harder one will be the same as for the kt and C/A algorithms in this range 
of x. 

The final formula for the passive area in the SISCone algorithm is given in appendix [A] The 
corresponding curves are shown in Fig. [3] (right) for R = 0.6 and four values of z. Contrary to 
the kt and C/A algorithms, here the dependence on the asymmetry parameter, z, is very strong 
for x > 1. We note that the average area in this region is bigger by the factor of around two 
for jets consisting of two subjets with comparable pt with respect to the jets whose constituents 
are strongly ordered in transverse momenta. As expected, in the limit z — > one recovers the 
result from the Fig. Q] (left) since x C 4 —> 1 and the third stable cone cannot exist for any value 
of x. 



The anti-fct algorithm is a sequential recombination algorithm with hierarchy inverted with 
respect to the ^-algorithm by using the measures dij = min(p^ 2 ,p^ 2 )A? ; /i? 2 and diB = p^ 2 ■ 
The hardest particle in a system will cluster first with anything within the geometric distance 
A < R. In the event with two particles of arbitrary transverse momenta and a ghost the three 
competing distances are A\ g , A2 g and A12. 

For < x < 1, the events in which the distance between the ghost and one of the physical 
particles is the smallest lead to formation of two small circles around particles p\ and P2 as 
depicted in Figs. [2](d) and (e). If, however, A12 < Ai s , A2 9 , then the real particles are clustered 
first, leading to the jet J12, which subsequently clusters with the ghost provided that the distance 
between the two is smaller than R. Up to a certain value of x, which we denote as x C 2, the two 
small circles are contained in the big circle of radius R and the area is simply that of 1-particle 
jet, i.e. 7ri? 2 . Above x C 2 the circle centred at the harder of the two particles protrudes and one 
gets the configuration shown in Fig. [2](d). Then, above x c s, the second of the two small circles 
starts sticking out leading to the jet depicted in Fig. 0(e). 

The conditions for the aforementioned critical values of x are given by 

2x c2 -xj = 1, (3.10) 

X C 3+XJ = 1. (3.11) 



l-z 

The approximate solutions to each of these equations can found for R < 1 

x c2 (z) ~ — |— , (3.12) 
1 — z 

Xcs(z) ^ —3. (3.13) 

1 — z + Z A 

For 1 < x < 2 the particles p\ and P2 form two separate jets. However, the shape of the 
jet centred at the harder of the two particles will not be entirely conical. The presence of the 
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Figure 4: Passive jet areas for the system of two particles with arbitrary ratio of values of trans- 
verse momenta from the anti-kt algorithm. The asymmetry parameter z is defined in Eq. (|3.ip 
and x is the distance between the two particles in the units of R. 



second particle will cause it to be clipped. This situation is shown in Fig. [5](f). The boundary b 
between the jets J\ and J2 is defined by zAif, = (1 — z)/±.2b- Hence, it turns out that the area of 
the jet J\ will be reduced with respect to ttR 2 by the area of the overlap region of the circle of 
radius R around that jet and the circle of radius ^\_2^ A and the centre away by A from 

the centre of the jet J\. Above x = 1/(1 — z) the two circles do not overlap and the area of the 
harder jet becomes perfectly conical. 

In Fig. HI we show the curves corresponding to the analytic results for the passive area from 
the anti-kt algorithm, which can be found in appendix [XJ One notices that, in general, the 
anti-kt jets are not perfectly conical. If a jet consists of two particles of comparable hardness 
separated by A 12 ~ R its area deviates from irR 2 , the more so the closer to each other are the 
transverse momenta of the two constituents. On the other hand, if the separation between two 
particles is smaller than 1/(1 + 2;) or greater than 1/(1 — z) or if their transverse momenta are 
strongly ordered the resulting area of harder jet is equal to that of a single particle jet. For 
the maximally symmetric system, corresponding to, z = 0.5, the anti-/cj result for passive area 
coincides with that from the C/A algorithm (cf. formulae from appendix [A]). However the two 
algorithms behave very different for z < 0.5. 

3.2 Active areas 

Since the computation of the active area involves clustering of a very complex system with a 
large number of ghost particles, in general, one needs to rely on the numerical analysis. As 
was the case for the passive areas also the active area results are expected to vary between the 
algorithms. This is because each algorithm comes with a different order of clustering of real 
particles and ghosts and it is this order that governs the behaviour of jet areas. 

The numerical analysis has a potential to produce slightly different results for jets oriented 
along the y or <p axis. This is because it operates on the real phase space for which these directions 
are not equivalent. One expects, however, that for the values of R which are sufficiently small, 
the corresponding differences should be largely subdominant. In practice, all the results shown 
in this and the following sections correspond to jets with the two constituent particles aligned 
along the rapidity axis. We have checked explicitly that the opposite extreme case of the particles 
oriented along the <p axis gives virtually the same results for the jet areas. The situation for 
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Figure 5: Active areas of the hardest jet for the system of two particles separated by the distance 
xR in the (y, 4>) plane and having arbitrary transverse momenta. The plots correspond to the kt 
(top left), C/A (top right), SISCone (bottom left) and anti-fct (bottom right) algorithms. The 
asymmetry parameter, z, is defined in Eq. (|3. 1 j) . All the results obtained with Fast Jet [30p31j. 



the mass areas discussed in section [J] is similar except for certain configurations studied with 
SISCone algorithms which we will comment on in due course. 

We have studied the active areas of the hardest jet in the system of two particles of arbitrary 
relative hardness. We performed analyses with the same four jet definitions as discussed in the 
previous subsection, i.e. kt, C/A, SISCone and anti-fcj together with the S-scheme for particle 
recombination. The results are presented in Fig. [5l The overall picture is very similar to that 
from the study of passive areas of preceding subsection. 

The "non-conical algorithms", kt and C/A, exhibit either no dependence on z, in the case 
of kt, or only a weak z-dependence in the case of C/A. The active areas from both algorithms 
behave very similarly to their passive area counterparts. For x < 1, where the two particles 
form a single jet, the kt active area grows steadily whereas the C/A active area stay practically 
constant for low x and starts growing rapidly above certain value of x. The pattern of mild z- 
dependence in the latter region is also the same as that seen for passive area, i.e. the results are 
slightly smaller for more symmetric system of two particles. For x > 1, the hardest jet consists 
of a single particle and its active area from k t and C/A does not depend on the asymmetry 
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parameter z. Apart from the similarity to the passive area results, the 2-particle active areas 
are smaller by about 20%. This has already been observed for the 1-particle jets and the 2- 
particle jets with strong p^-ordering in [3 J and we have also recalled those results in table [T] and 

Fig.m 

The SISCone algorithm gives the active area which depends quite strongly on z. We see 
that, for z ~ 0.5, it stays well above the 1-particle result, ttR 2 /4, also for x > 1 and then 
it drops at some value, just as was the case with passive area. Again, this is related to the 
existence of the third stable cone containing both particles p\ and p2 ■ Below a critical value of 
x, the same as that given in Eq. (|3.9p . this third stable cone is being merged leading to large 
jets. However, there is also a difference between the cases of passive and active areas for large 
z and x > 1, namely in that the active area falls with x for 1 < x < x C 4 whereas the passive 
one keeps growing in this region. The mechanism responsible for this effect is the same as that 
which leads to the reduction of the 1-particle active SISCone area by the factor 1/4 with respect 
to the passive area of 1-particle jet as explained in [3]. It is related to additional splittings of 
stable cones with physical particles which overlap with stable cones built up solely of ghosts. 
Such splittings involving the central stable cone from Fig. [2](c) lead to narrowing the jet with 
increasing x. This may lead to the active area of a jet containing two particles being smaller 
than the active area of a 1-particle jet. As shown in Fig. [5] (bottom left) such situation indeed 
happens for the system with z close to its maximal value 1/2 (identical transverse momenta of 
the particles pi and P2). In this case, the critical value x C 4 is reached for very high x (or never 
in the case of z = 1/2) and the 2-particle active area can smoothly decrease below the 1-particle 
result. 

The anti-fci active area results shown in Fig. (bottom right) are identical to the passive 
areas from Fig. [H This comes from the fact that the ghost particles cluster among themselves 
only after all clusterings involving perturbative particles. The equivalence of the passive and 
active areas from anti-A;t for the 2-particle jets with strongly ordered transverse momenta of the 
two constituents, corresponding to z = 0, has been pointed out in [27]. Their equivalence for 
arbitrary z, illustrated in Figs. 2] and [5] (bottom right) is also known and has been taken into 
account in the Fast Jet program (see the code accessible in [31] )• As in the case of SISCone, also 
for the anti-fcj algorithm there is a region of strong z-dependence. 

For all the algorithms and all the z values shown in Fig. [5l the 2-particle active areas tend 
to the 1-particle results in the limit x — > 0. However, in the limit x — > 2 the results converge 
to the 1-particle area only for the "conical algorithms", i.e. SISCone and anti-Zc^. For kt and 
C/A the 2-particle jet areas are different from the values given in table [T] even if the separation 
x > 2. This is related to the fact that these algorithms build up the jets starting from formation 
of local structures which are subsequently merged leading to jets of very irregular areas. 

4 Mass area 
4.1 Jet mass 

The mass of a light quark jet arises due to its substructure. If a jet Jyi is obtained from clustering 
two subjets Ji and Ji with masses much smaller than their transverse momenta, mj 1 2 <C ptj 1 2 , 
then the mass of the jet J\2 in the small R limit is given by 

m Ji2 - m \ + m J 2 +PtJiPt.h^\2 (4- 1 ) 
= rn 2 Jl +m 2 j 2 +z(l-z) Pt j l2 Al 2 , (4.2) 

with Af 2 = (yji - yj 2 ) 2 + (0j x - <pj 2 ) 2 and z defined in Eq. (HQ}- 

Jet mass is an infrared and collinear safe quantity that can be calculated order by order in 
perturbation theory. Because of the soft and collinear singularity of the QCD matrix element 
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for gluon emission, the distribution of masses of the QCD jets gets strong enhancement for 
low values of mj. At the lowest non-trivial order (i.e. NLO of the perturbative a s expansion) 
the approximate result for the mass distributions of QCD jets is given by [UEJIT7] = 



Oi s (ptj)r^~ l n ( ) ' wnere C is the colour factor of the initiating parton. The higher order 



terms are enhanced by further powers of In ^ . The resummed corrections are known for jets 
from e + e _ [33H35] and DIS [36J. Contrary to the case of QCD, the distribution of jets coming 
from decay of a heavy object is flat in z and therefore the mass distribution of such jets is peaked 
around the mass of the heavy object which originated them. 

As discussed in the preceding sections, the area of a jet provides a measure of the suscep- 
tibility of the jet's momentum to soft background. Such a measure, combined with a method 
of determination of the level of this background, like the one discussed in (201.121] . allows one to 
account for the contamination from UE/PU and correct the momentum of the jet accordingly. 

Similarly, one can define a quantity which measures how much the mass of a jet can be 
modified by the soft radiation for jets defined with a given algorithm. In what follows, we 
define such a new characteristic of jets, which we call the mass area, and use it to study 1- and 
2-particle jets from the four jet-clustering algorithms. 

4.2 Passive mass area 

In analogy to the passive jet area from section I2.lt the passive mass area of the jet J can be 
defined as 



where mj is the mass of the jet J, mj 5 is the mass of a jet that consists of the jet J and 
the ghost g and pt g is the transverse momentum of that ghost. The passive area defined in 
the above equation is dimensionless. Its value reflects susceptibility of the mass of a jet to 
the contamination from soft radiation in the limit in which this radiation is infinitely soft and 
pointlike. 

For a jet consisting only of a single hard particle with transverse momentum pn, the passive 
mass area for all the four algorithms is given by 



The above result coincides with the polar moment of inertia of a disk (or cylinder) of radius R. 
This correspondence is general and, in fact, the passive mass area defined in Eq. (|4.3p is nothing 
but the polar moment of inertia, i.e. the measure of resistance of an object to torsion. This 
resistance is small if the mass is distributed close to the rotation axis (here, the jet centre) and 
large if the mass extends far away from the rotation axis. 

4.2.1 Passive mass areas for general case of 2-particle system 

The calculation of the passive mass areas for the system with two particles of arbitrary z proceeds 
in close analogy with the calculation of passive areas for that system which led to the results 
presented in section 13.11 In particular, all the subranges of the separation variable x and the 
corresponding pictures from Fig. [5] are valid also for passive mass areas. However, now the 
integrand in the definition given of Eq. (|4.3p is less trivial. 

The total squared mass of the system composed of two massless perturbative particles and 
the ghost g is given by 






(4.4) 



m 



12g - PtlPt2&l 2 + PtlPtg^lg + Pt2Ptg&l g ■ 



(4.5) 
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Figure 6: Passive mass areas of the hardest jet for the system of two particles with arbitrary ratio 
of transverse momenta in the case of the kt (left) and C/A (right) algorithms. The parameter 
z is defined in Eq. (|3.ip and x is the interparticle distance in units of R. The value z = 
corresponds to strongly ordered transverse momenta of the two particles and the value z = 0.5 
to the system of two particles of equal hardness. 



If the two particles p\ and p2 come from the soft QCD splitting the last term is negligible. If, 
however, they come from a decay of a heavy object, the last two terms are commensurate. 

Plugging the above expression (|4.5p into the definition (|4.3p allows one to obtain analytic 
results for passive mass areas from all four algorithms. We give the corresponding formulae in 
appendix[Aj Below, we comment on the results for each of the four algorithms, which are shown 
in Figs. E] and [3 

The kt algorithm produces jets whose passive mass areas do not depend on the relative 
hardness of the two constituent particles. This occurs in spite of the fact that the integrand in 
the definition (|4.3p with mj g taken from Eq. (|4.5p does depend on z. However, because the kt 
algorithm always clusters the ghost first with one of the particles pi and p2, the shape of the 
jet in the (y, 4>) plane has an additional reflection symmetry. This, in turn, implies that the 
integrated contributions from each particle differ only by the multiplicative factor, 1 — z for p\ 
and z for p2- Therefore, the z-dependence cancels in the sum. As shown in Fig. [6] (left), the 
qualitative behaviour of the mass area is the same as that of the area from Fig. [1] (left). As long 
as the separation between the particles x < 1 the passive mass area grows fast with increasing x. 
Quantitatively, however, the change of the passive mass area of the 2-particle jet with respect 
to the 1-particle jet is much bigger than the corresponding change for the passive jet area. As 
we see by comparing the results from Figs. [T] (left) and [6] (left), the former changes by the factor 
~ 3.6 in the range x < < 1 while the latter only by the factor ~ 1.6. For x > 1 the hardest jet 
consists solely of a single particle. However, the presence of the second jet in the neighbourhood 
causes its mass area to be slightly smaller than ttR 4 /2. The value of a single particle mass area 
is being slowly approached as we go to x = 2. 

The Cambridge/ Aachen algorithm gives jets with mass areas weakly dependent on the 
asymmetry parameter z as is depicted in Fig. [6] (right). As in the kt algorithm, also here the 
overall shape of the mass area as a function of the distance between constituent particles p\ and 
P2 is very similar to the shape found for the passive area (cf. Fig. [3]). The quantitative change 
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Figure 7: Passive mass areas of the hardest jet for the system of two particles with arbitrary 
ratio of transverse momenta in the case of SISCone (left) and anti-A^ (right). The parameter z 
is defined in Eq. (|3.ip . The value z = corresponds to strongly ordered transverse momenta of 
the two particles and the value z = 0.5 to the system of two particles of equal hardness. The 
SISCone result does not depend on the value of / parameter. 

is, however, again much bigger for the mass area whose 1-particle value (|4.4p can be modified 
up to the factor ~ 3.6 by the presence of the second particle (comparing to the corresponding 
factor of ~ 1.6 for the area). As can be found by inspecting Fig. [6] (right) or the corresponding 
formulae from the appendix [XJ there is also a small qualitative difference between passive area 
and passive mass area in the behaviour for < x < x c \. The mass area starts growing with x 
right from the beginning contrary to the area which is constant for x < x c \. 

The SISCone algorithm returns jets with mass area strongly dependent on the separation x 
between two constituent particles. For x < 1 the mass area differs from the 1-particle result 
only mildly, growing slightly with x (unlike the area which is constant in this region, cf. Fig. [3j). 
For x > 1, however, the mass area of 2-particle SISCone jets jumps by the factor of four and 
continues growing very fast with x reaching the value ~ 11 for the 2-particle system with z = 1/2 
and the separation x = 2. We have seen already a similar behaviour for the areas of SISCone 
jets shown in Fig. [3] (right) but it is much bigger in quantitative terms for the mass area. The 
cause of the big change of the mass area at x = 1 is the same here, namely for two particles with 
comparable transverse momenta there is a region of x where there are three stable cones which 
all get merged leading to a gigantic jet. One can exploit this property in two different ways. 
If one is interested just in measuring the jet mass with an algorithm which is as little sensitive 
to the soft pointlike radiation as possible than, clearly, the result from Fig. [7J (left) strongly 
disfavours SISCone. This is especially true if the jet comes from decay of a heavy object in 
which case its subjects have similar hardness and the separation x may be easily greater than 1. 
The result from Fig. [7J (left) could alternatively be regarded as a useful additional characteristic 
of a jet. It could be used to devise some discriminating variable which would help separating 
QCD jets, which have small mass area, from the jets coming from a heavy object decay, which 
exhibit significantly larger passive mass area. 

The anti-fct algorithm produces jets with mass area growing slowly with x up to the critical 
value x C 2 from Eq. f)3. 12|) . Between x C 2 and 1 the growth becomes much faster and the more so 
the closer to each other are the values of transverse momenta of the two constituent particles. 
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For x > 1 the hardest jet consist of a single particle and its mass area slowly approaches ttR 4 /2 
with increasing x. Hence, qualitatively, the behaviour is not very different from that seen in 
Fig. [J] for the passive area except the region below x C 2, where there was no growth in the latter 
case. But as for the three algorithms discussed above, also for anti-fc^ the quantitative effect of 
adding a second particle is much bigger for the mass area than for the area of a jet. Overall, 
the passive mass area from the anti-fct algorithm may be substantial especially for symmetric 
configurations (z ~ 0.5) with interparticle separation x ~ 1. One notices also that the anti-fct 
result for z = 0.5 coincides with that from the C/A algorithm for the same z value. This is 
reflected as well in the exact formulae given in appendix [Aj However, as we go away from 
z = 0.5 the two algorithms behave very different as seen from Figs. [6] (right) and [7] (right). 



4.2.2 Scaling violation of passive mass area of QCD jets 

Mass area is sensitive to substructure of a jet. For the QCD jets this substructure arises due 
to radiative emissions of gluons. Therefore, we expect that the average mass area of a QCD 
jet will acquire logarithmic dependence on jet's transverse momentum. The coefficient in front 
of this logarithm, which we will call anomalous dimension, can be easily found in the small R 
approximation. The results for jet areas were obtained in [3]. Here, we will determine their 
passive mass area counterparts. 

The mean mass area at the order a s for a given jet algorithm and with a given R value can 
be written as 

W = a m (0) + (Aa m ) = |i? 4 + (Aa m ) . (4.6) 

The 0{a s ) correction in the limit of strongly ordered transverse momenta of the particles, 
Pt2 "C pa, adequate for QCD jets, is given by 



f 2R f Ptl dP 

(Ao m )~ / dA 12 / dp t <2-, -r— (a m (Ai 2 ) -a m (0)), (4.7) 

Jo JQo/A 12 dp t 2dA 12 

dP 



with dpt 2dA 12 b em § the probability for emitting a gluon with transverse momentum^ at relative 
angular distance A12 and the second term in the bracket accounting for virtual corrections. The 
lower limit of the integration over pt2 contains a cut-off Qo for the relative transverse momentum 
of the particle p 2 with respect to particle p±. The need for such a cut-off comes from the fact 
that the mass area, just like the area of jets, is not an infrared safe quantity and its value 
depends on non-perturbative effectsU The convergence of the integral over A12 is guaranteed 
by the property that the passive mass area of the hardest jet in the 2-particles system tends to 
the 1-particle result both when A12 — > and A12 —> 2R. Taking the QCD matrix element in 
the soft and collinear approximation 



dP 2d a s (p t2 A 12 ) 



dp t 2dA 12 vr Ai 2 pt2 
and performing the integration in Eq. (|4.7[) . one finds 



(4.8) 



/a \ j 2a s Ci Rpa , . Ci a s (Q ) 

(Aa m ) = d m In——, (Aa m ) = d m — -In — — - — -, (4.9) 

7r Q 7r&o a s (Rpti) 

in the fixed and in the running coupling approximation, respectively. In the latter case A12 
was replaced by R in the argument of the coupling which affects only the terms not enhanced 
by the logarithm of R. Ci is a colour factor corresponding to the parent particle and 60 = 
(llC A -2n / )/(12vr). 



2 As argued in [3], events with pile-up provide a natural infra-red cut-off which replaces Qo- 
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Table 2: Coefficients governing the logarithmic scaling violation of passive mass areas with 
transverse momentum of a jet for 2-particle QCD jets. The analytic results are normalised to 
i? 4 . We use the shortcut notation for £ = (^'(1/6) + ^'(1/3) - ^'(2/3) - ^>'( 5 / 6 )) /(48>/3) - 
0.507471 where ^'(x) is the trigamma function. In the results for s m , £(3) ^ 1.202 is a special 
value of the Riemann zeta function. The numerical results are normalised to the passive mass 
area of a 1-particle jet. 

The coefficient d m , which depends on jet definition, is the aforementioned anomalous dimen- 
sion and it is given by 

d m = J T (a m (d) - -R*) , (4.10) 
In a similar manner, one can compute fluctuations of mass areas defined as 

(O = (al) ~ (a m ) 2 = <r 2 m (0) + (&a 2 m) - (Aa m ) 2 ^ (Aa 2 m ) , (4.11) 

where we have dropped cr^(0), which is identically zero, and (Aa m ) 2 as it gives higher order 
corrections in a s . A calculation similar to the above leads to the results identical to those given 
in Eq. fj4.9|) with just d m replaced by s m where the latter is defined as 

r 2R jn _ 

sl = I y(« m W-|i? 4 ) 2 . (4.12) 

The analytic results for the coefficients d m and s m , normalised to R 4 , for all four algorithms 
are given in table [2j There, we also quote their approximate numerical values normalised to 
the 1-particle passive mass area. One notices that the coefficients d m depend strongly on jet 
algorithm. The largest value is found for the kt algorithm. The next in the hierarchy is the 
C/A algorithm with its d m coefficient already more than factor four smaller of that from kt- 
SISCone produce fairly small and negative result whereas anti-A;t yields identically zero. The 
observed hierarchy is consistent with the behaviour of passive mass areas of strongly ordered 
system (i.e. z = 0) from Figs. [6] and [71 The large coefficient for the kt algorithm comes about due 
to strong rise of the passive mass area in the region of small interparticle separations enhanced 
in the integral (|4.10p . The smaller d m from C/A is related to the fact that the mass area in 
this algorithm becomes significantly different from the 1-particle result at x > 1/2 hence in the 
range which is less favoured by (|4.10p . Similarly, the small and negative d m from SISCone comes 
from the fact that the mass area in this algorithm deviates from 7ri? 4 /2 only for x > 1 where it 
becomes lower than the mass area of a 1-particle jet. One practical conclusion from table is 
that the passive mass areas of jets from kt algorithm will depend much more strongly on those 
jets' pt than will the passive mass areas of other algorithms. This is a similar conclusion to 
that found in [3] for areas of jets. The values of s m coefficients from table [2] suggest significant 
fluctuations of the passive mass areas of QCD jets. Here the pattern essentially follows that of 
d m coefficients with, however, a somewhat smaller difference between kt and C/A algorithms. 
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4.3 Active mass area 



We define the active mass area as follows 

A m (J)= lim ( J -f\ — ) , (4.13) 

v { gi }^°° \ V{ gi } (Ptg) PtJ{ 9i } / g 

where mj is a mass of the pure jet J and m j{ gi } is a mass of the jet consisting of J and a 
dense coverage of ghosts from some random ensemble {gi}- Similarly, Ptj{ 9i } is a transverse 
momentum of the whole jet with real and ghost particles. The ghosts have density V{ gi ] and 
the infinitesimally small average transverse momentum {ptg)- The limit of infinite density of 
ghosts is taken and, in addition, the result is averaged over many sets of ghosts. The standard 
deviation of the distribution across these ghost ensembles is given by 

T? m (J) = lim (A 2 m (J\{ gi })) -A 2 m (J). (4.14) 

Consider the system with one or more particles whose transverse momenta are well above 
the ghost scale (ptg)- In the case in which such particles are massless 

m J{ 9 ,} ~ m J = 2u im} (Pt 9 )rt {9i} M J \{9i}) ~ ^ } (pt 9 ) 2 ^(J|{ft})^(J|{ 5i }) , (4.15) 

where we used the definition of 4-vector active area A^( J\{g{\) from Eq. l\2A\) . Note also that 
Pjig.\ is a 4-momentum of the whole jet consisting of physical and ghost particles. The two 
terms on the right hand side are of two fundamentally different scales. The second term is itself 
an interesting characteristic of a jet and, as we shall see in Section T5.21 there are cases in which it 
is useful to know it. However, because of an extra power of an arbitrary small ghost transverse 
momentum, (ptg), the contribution of this second term to the mass area, as defined in Eq. (|4. 13[) 
is negligible and that is why we drop it here. This, together with combining Eqs. (|4.13p and 
(|4.15p . leads to the following formula for the active mass area 

^(physical jet J) = — PjA^J) , (4.16) 
Ptj 

which is particularly convenient to work with. In what follows, we will be computing active mass 
areas of jets using the above equation with the 4-vector area A^(J) calculated with FastJet. 
For definition of the latter quantity we refer to Section 12.21 the original paper [3] or FastJet 
documentation [51] . 



4.3.1 Active mass area for 1-particle jet 

The kt and Cambridge/ Aachen algorithms allow only for numerical study of active mass 
areas. The formula (|4.16p can be applied directly for 1-particle jet. The distributions of active 
mass areas from the two algorithms, normalised to ttR /2, are shown in Fig. [S] (left). The results 
from kt and C/A are very close to each other. Similarly to the case of the jet areas [3J, the 
maxima of the distributions lie significantly below 1. The corresponding results for the average 
mass areas and their standard derivations are given in table [3j The 1-particle active mass area 
is very close to the 1-particle passive mass area but if fluctuates significantly across the ghost 
ensembles. This is partly different from the jet area case where the values of active areas were 
consistently 20% below those of passive areas for both algorithms [3] (cf. table [[]). Qualitatively, 
however, the results shown in Fig. [8] (left) and in the first two rows of table [3] are similar to 
those found in [31 for jet areas. 
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Figure 8: Distribution of active mass area A m of 1-particle jets from the kt and C/A algorithm 
(left) and from SISCone and anti-Zc^ (right). The curves correspond to numerical results obtained 
from FastJet. The width of SISCone and anti-fcj distributions arises solely due to finite binning. 



algorithm 



1-particle-jet 



A m /(7rR 4 /2) £ m /(^R 4 /2) 



C/A 
SISCone 
anti-fci 



1.05 
1.02 
1/16 
1 



0.65 
0.61 







Table 3: Average active mass areas for 1-particle jet together with corresponding standard 
deviations for four algorithms. The numbers for kt and C/A correspond to the distributions 
from Fig. [8] (left) whereas for SISCone and anti-fcj analytic results area given. The latter are 
confirmed by the numerical study as shown in Fig. [8] (right). 



The SISCone and anti-fc^ algorithms allow for analytic study of the mass areas of 1- 
particle jets. As pointed out in [3], the split-merge procedure used in SISCone always results in 
the split between two stable cones both if one of them does or does not contain a hard particle. 
This, in turn, reduces the radius of the hard jet by the factor 1/2 and therefore, from (j4.4f) . 
the active mass area by the factor 1/16. This result does not depend on the ghost ensemble, 
assuming that the coverage of ghosts is sufficiently dense, hence the fluctuations vanish. 

The anti-fcj algorithm leads to 1-particle jets of a circular shape with radius R. Therefore, 
the active mass area of such jets coincides with the passive mass area result ()4.4p and the 
fluctuations of the active mass area are identically zero. 

These analytic results are summarised in table [3l The corresponding distributions from 
numerical study are shown in in Fig. [8] (right). We see that both algorithms give distribution of 
active mass areas for 1-particle jets which are close to (5-function. The width comes solely from 
finite binning. 

4.3.2 Active mass areas for general case of 2-particle system 

The results for active mass area of the hardest jet in a system with two particles of arbitrary 
relative hardness are given in Fig. [9l As before, we present the active mass areas normalised 
to 7ri? 4 /2 as functions of the separation (in units of R) between the two particles. All curves 
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Figure 9: Active mass area of the hardest jet for the system of two particles with arbitrary 
ratio of transverse momenta. The results from four jet algorithms are shown as functions of the 
interparticle distance x for several values of the asymmetry parameter z. 



correspond to numerical computations with Fast Jet. One has to keep in mind that in general, as 
was the case for 1-particle mass areas, the 2-particle mass area is a distribution and the curves 
shown in Fig. [9] corresponds to its mean value. 

The active mass area from the kt algorithm does not depend on the asymmetry parameter z, 
just as the passive area. For C/A this dependence is weak. On the other hand, similarly to the 
case of passive mass areas, the active mass areas of SISCone and anti-fcf strongly vary depending 
on whether the two constituent particles are of comparable hardness or whether their transverse 
momenta are significantly different. 

The active mass areas from the sequential recombination algorithms are virtually, for kt and 
C/A, or exactly, for anti-fej, identical with their passive mass area counterparts. Regarding only 
the shape, the situation was quite similar for the 2-particle areas from kt and C/A, only that 
there the normalisation of the active area was different. Since, as seen from the first two rows 
of table O the active and passive 1-particle areas are almost identical for kt and C/A, also the 
results from Fig. [9] and Figs. [6] and [7] coincide to large extent for those algorithms. The identity 
of passive and active mass areas for anti-fet, has the same origin as the analogous identity for the 
areas observed in section [3] and it comes from the fact that the ghosts cluster among themselves 
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after all clusterings with physical particles have occurred. Altogether, for the active mas areas 
from the sequential recombination algorithms, one observes the same pattern in the relation of 
these results to the active areas as seen earlier for the passive quantities. Specifically, while the 
qualitative picture for the active areas and active mass areas is very similar, quantitatively the 
effects seen for the latter are much stronger. 

The case of SISCone is quite special. Firstly in that its 1-particle active mass area gets 
modified very strongly by the presence of the second particle of comparable hardness. The 
similar conclusion was drawn already for the passive mass areas (cf. Fig. [7]). However, in 
absolute terms, the active mass area of a 2-particle SISCone jet remains still much smaller than 
both its passive counterpart as well as the active mass areas from all the other algorithms shown 
in Fig. [9) This implies that the sensitivity of the jet mass to soft background should be the 
lowest for SISCone. This, in turn, translates into a particularly good mass resolution of this 
algorithm seen e.g. in [6]. 

The mechanism responsible for the strong relative change of the the 2-particle jet active mass 
area with respect to the 1-particle case for SISCone is the same as discussed in section 13.11 for 
the passive area and refers to the existence of the third stable cone containing the two physical 
particles. This cone disappears at x = 1/(1 — z) and two particles separated by greater distance 
form two distinct jets and hence the drop of the active mass area seen in Fig. [9] (bottom left). 
If the value of z is sufficiently large and the drop occurs at x ~ 2, the active area of a 2-particle 
jet from SISCone starts falling at some x, an effect of split-merge procedure involving the third 
stable cone (with particles p\ and P2) and the pure ghost cones. This is seen in Fig. [9] (bottom 
left) for the 2-particle system with z = 1/2. A similar effect was discussed in section ET21 for the 
active area. 

For all algorithms the value of 1-particle passive area from table [3] is recovered at x = 0. 
However, at x = 2 only SISCone and anti-fc t yield irR 4 /2 and a larger value is given by kt and 
C/A. As mentioned in section T3.21 this comes from the fact that those algorithms build jets 
starting from formation of local structures. 

A general comment concerning the results of Fig. [9] is that sensitivity of the active mass area 
to the relative hardness of the constituent particles (reflected in the value of z) is related to the 
shape of jets produced by a given algorithm. Namely, the algorithms which depend strongly on 
z are those belonging to the class of "conical algorithms", i.e. SISCone and anti-fct. Though, as 
noticed earlier, in general, their jets are not ideally conical, still their shapes in the (y, 4>) are 
usually quite regular. Conversely, the kt and C/A algorithms, whose jets are highly irregular in 
shape show either none or weak dependence on z. 

The whole variety of behaviours of the active mass areas observed in Fig. [91 depending on 
the algorithm, asymmetry parameter z or the interparticle distance x encourages one to exploit 
it on the analysis-by-analysis basis. 

4.3.3 Scaling violation of active mass area of QCD jets 

We conclude this section by the study of average leading effect of perturbative radiation on the 
active mass areas of QCD jets. In analogy to the passive mass area, we define 

(Am) = A m (0) + (AA m ) = A m (1-particle-jet) + (AA m ) , (4.17) 

where ^4 m (l-particle-jet) depends on jet algorithm as summarised in table The perturbative 
correction to the 1-particle-jet result can be computed from the formula analogous to Eq. (j4.7|) 
with a m replaced by A m and the upper limit 2R removed. The latter is related to the fact that 
the active mass area of a 2-particle jet may in general be different than A m { 1-particle-jet) for 
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algorithm 


2D m 

TTi? 4 


irR 4 


h 


1.694 


1.415 


C/A 


0.387 


0.781 


SISCone 


0.086 


0.101 


anti-A^ 









Table 4: Coefficients governing the logarithmic scaling violation of active mass areas with trans- 
verse momentum of a jet for 2-particle QCD jets, normalised to the passive mass area of a 
1-particle jet. The numerical results obtained by performing interactions from Eqs. (|4.18p and 
(|4.22p using the functions corresponding to those shown in Fig. [9) 



A12 > 2R, as noted in the previous subsection. Simple integration gives 

(AA m )=D m ^-hx "'/iH , An= / ^r(A m (9) - A m (0)) , (4.18) 

7t6 a s (Rpti) J 6 

and the analogous fixed-coupling result as in section 14.2.21 

As discussed at the beginning of section 14.31 the active mass area comes with an intrinsic 
fluctuations due to fluctuations of ghosts. Therefore, the fluctuations of the active mass area of 
QCD jets can be separated into two components 

(Y? m ) = 5^(0) + <A£^) , (4.19) 

where the first one, being just the contribution from one-particle jets, is given for each of the 
four algorithms in table [3l The second term comes from 2-particle configurations. It acquires 
contributions both from the change of the mass area caused by the perturbative radiation (as 
in the passive case) and from the fluctuations of ghosts used to determine the mass area of 
those configurations (absent in the passive case). The corresponding formulae for active areas 
was derived in [3]. A straightforward, analogous derivation leads to the following result for 
logarithmically enhanced, 0{a s ) contribution to the active mass area 

(ASL)-^ln^) , (4.20) 
7t6 a s (Rpti) 



Sl = jjj [(A m (9) - A m (U)f + Z 2 m (6) - $4(0)] (4.21) 
= J f(Al(9) - A 2 m (0)) - 2A m (0)D m . (4.22) 



The results for the coefficients D m and S m , normalised to ttRt/2, are given in tabled! The 
numbers come from integration of the numerical results for active mass areas in the case of kt 
and C/A algorithm and the analytic results in the case of SISCone and anti-fct. One notices that 
the anomalous dimension and its fluctuations for active mass areas are very close to those found 
for passive mass areas with an exception of SISCone whose active area anomalous dimension, 
though of similar absolute magnitude, comes with an opposite sign. Therefore, most of the 
discussion from section 14.2.21 related to table remains valid also for the results from table HI 
The only qualitative difference of the positive D m versus negative d m for SISCone comes from 
the fact that for the former the active mass area of the hardest jet in the 2-particle system never 
goes below the 1-particle result (see appendix IB"|) . 
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5 Illustration with Monte Carlo events 



5.1 Mass areas of simulated jets 

Real jets are of course more complex than just the 1- or 2-particle systems that we studied so 
far. Nevertheless, we believe that a series of features of mass areas from those found in the 
preceding sections will be present also in jets measured in the real life. To provide some support 
to that statement, in this section, we perform a brief study of mass areas of jets from Monte 
Carlo (MC) simulation. Compare to the 1- or 2-particle jets discussed earlier, the simulated jets 
will have more accurate modelling of QCD radiation, in particular that associated with parton 
shower, as well as hadronization. 

We will examine jets from Pythia 6.4 [37J dijets events with the underlying event switched 
off. As before, we will be interested in the hardest jet in an event. We will not, however, impose 
any rapidity or transverse momentum cuts. Our aim will be to obtain an analog of Fig. [U] 
for the MC jets. In the case of two particle system each event was characterized by the the 
asymmetry parameter and by the angular distance between the particles, defined respectively 
in Eqs. (|3.ip and (|3.2p . Those particles were meant as an approximation to two subjets of a 
realistic jet. The meaningful subject analysis of real jets is, however, possible only for some jet 
algorithms. Moreover, it is not very useful in the region of x > 1. Therefore, for the purpose of 
this Monte Carlo study, we need to use slightly more sophisticated strategy. It will be based on 
the procedure from [2"ll28| were it was employed to study the reach of jet algorithms. It involves 
using a "reference algorithm" for which choose C/A with R=1.2. First, an event is clustered 
with this algorithm and the hardest jet is decomposed into its two main subjets Si and 52- 
Those subjets are used to determine x and z. Then, the same event is clustered with one of the 
four "test algorithms", kt, C/A, anti-kt or SISCone with R=0.6. Subsequently, one looks for the 
hardest "test jet" which belongs to the same hemisphere as the hardest jet from the reference 
algorithm. The mass area of this jet is assigned to the (x, z) pair determined in the first step. 
If the separation x between the two subjets, S\ and S2, is small, they will predominantly both 
end up in the hardest test jet. If, on the other hand, this separation is large, only one of them, 
either S\ or 52, will have significant overlap with the test jet. Each of the two situations should 
be reflected in the value of the mass area of the test jet. 

The results obtained after applying the above procedure to the dijet events from Pythia are 
shown in Fig. [TUl where the mass areas are presented as functions of x in bins of the asymmetry 
parameter z. Assigning correct substructure is difficult for the cases with large asymmetries, 
and that is why we do not go below z = 0.1. Otherwise, as shown in [2j[28], the method works 
well with perhaps slightly higher uncertainties for x ~ 1 and z being close to its either lower 
(kt) or the upper (anti-A^) limit. 

The first observation from Fig. [TUl is that all four algorithms give results which are in 
qualitative agreement with the 2-particle picture of Fig. [9j The general pattern of the growth 
of mass area with x and then the drop at some point for x > 1 is well reproduced. Also the 
sensitivity to the z value, low for kt and C/A, noticeable for anti-fej and large for SISCone, 
is consistent with the 2-particle picture. There are, however, quantitative differences between 
Figs. [9] and [lOj They are clearly related to the extra amount of perturbative radiation which 
builds up the structure of physical jets. 

Let us begin with the three sequential algorithms, kt, C/A and anti-fcj. In the 2-particle case 
the active mass areas were in the same ballpark. As seen from Fig. [TUl for MC jets, the three 
algorithms exhibit clear hierarchy with the mass areas from kt being on average significantly 
higher than those from C/A, which in turn are much larger than the mass areas from the anti-kt . 
This can be understood by noticing that the above hierarchy is consistent with the one found 
in section 14.3.31 for the scaling violation coefficients D m (table 0]). The large coefficient for 
the kt algorithm means that even collinear emissions can lead to a significant increase of mass 
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Figure 10: Active mass area of the hardest jet from Pythia dijet events with the underlying 
event switched off. The results from four jet algorithms are shown as functions of the distance x 
between two main subjets for several bins of the asymmetry parameter z. The procedure used 
to determine x and z is described in the main text. 



area. For realistic MC jets multiple such emissions are provided by parton shower. This also 
explains somewhat smaller, but still significant difference in mass areas between the 2-particle 
and MC jets from C/A and a very small difference in the case of anti-fcj, whose scaling violation 
coefficient is identically zero. Another quantitative difference is related to the value of the mass 
area for x > 1. In the 2-particle system this value was close to the passive area of a 1-particle 
jet. For MC jets, though there is a significant drop above x = 1, which we interpret as the 
two subjets not being merged, the mass area in that region is not necessarily close to the 1- 
particle mass area. The latter is related to the fact that the two widely separated subjets, Si 
and S2, with x > 1 have enough room to develop their own substructure and hence cannot be 
approximated by a single particle. The extent to which their mass area differs from 7ri? 4 /2 is 
again related to the scaling violation coefficient and the same hierarchy is observed. We have 
also checked that the values of mass areas for x = 2 are indeed very close to the average mass 
area of the hardest jet in the system. 

The case of SISCone algorithm is special because of its highly nontrivial dynamics involving 
the split-merge procedure and therefore it should be discussed separately. As already mentioned, 
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the general pattern found for MC jets is the same as the one from 2-particle results. In particular, 
we see that the SISCone jets with finite z often have very large mass areas even for x > 1, which 
would point out to the interpretation that their sub jets are likely merged in this region. We 
note also that this observation is compatible with the study of reach of the SISCone algorithm 
from [21[28] and in particular with the discussion therein related to the i? S ep parameter. As in 
the case of kt and C/A, also the SISCone jets from Pythia exhibit somewhat larger mass areas 
than the 2-particle jets. Part of the reason is again some sensitivity to additional radiation from 
parton shower, though that must be moderate given that fact that the D m coefficient in tabled] 
is not very big. Another important mechanism which leads to larger mass areas of the SISCone 
MC jets is related to the split-merge procedure. As discussed in section 14.3-H the small value of 
the mass area of 1-particle jet arises due to the fact that the cone around that particle overlaps 
with cones of pure ghost jets and since there is no other particle that they could share such 
overlapping cones always split. This must be somewhat different in the realistic event which 
is populated with many physical particles. As a consequence, the number of pure ghost jets 
is greatly reduced and therefore the above mechanism, which led to reduction of mass area of 
the 1-particle jet, is not that efficient here. Similar conclusion can be drawn from the study 
of the areas of realistic jets from [3]. To further test this reasoning we varied the split-merge 
parameter / and observed that lower value of this parameter (corresponding to easier merging 
of overlapping cones) leads to larger average value of mass area of the hardest jet and vice versa. 

The mass areas of realistic jets from Monte Carlo simulations deserve detailed study. In this 
section we gave a brief illustration of what sort of effects one may expect if one goes beyond the 
2-particle approximation of a jet. The main conclusion from our MC study is that the 2-particle 
results for x and z dependence, highlighted in Fig. O together with the study of sensitivity 
of the algorithms to the perturbative radiation, allow one to explain most of the features of 
mass areas of the simulated jets. Fig. [TU] provides additional guidance for the choice of the 
jet algorithm which minimizes background contamination. Consistently with the results of the 
2-particle study from preceding sections, it points at SISCone and anti-fcf disfavouring, in that 
particular respect, the kt and C/A algorithms. 



5.2 Correcting jet mass for pileup contamination 

The definition of the active mass area from Section POl suggests its practical application. Suppose 
that instead of ghosts we have in our event a dense set of soft particles distributed fairly uniformly 
in rapidity and azimuthal angle. Such particles may be coming, for instance, from pileup (PU). 
They will normally be clustered together with genuine jet particles leading to contamination of 
the jet. This contamination, in turn, will cause a systematic shift of a mass of the jet 

m %v = m J + Sm2 > C 5 - 1 ) 

where raj pu is a mass of the jet J from an event with pileup whereas m?j is a mass of the same 
jet from an event without pileup. 

Consider an event with the average density of the transverse momentum of PU particles per 
unit area equal to p. Then, in the case of massless hadrons, the magnitude of the shift of the 
mass of jet Jpu due to pileup is given by 

5m 2 (J PlJ ) = p tJpv p A m (J PU ) - p 2 Al(J PV ) . (5.2) 

As we see, the leading correction in p comes from a term involving active mass area. It is 
important to notice that the mass area is computed from Eq. (|4.16p using the uncorrected jet 
Jpu containing both genuine jet particles and the contamination from pileup. That means that 
^4m(<^pu) itself acquires a subleading contribution 0(p/ptj PV ), which in turn implies that the 
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Figure 11: Average mass of the hardest jet in dijet events at the LHC at y/s = 7 TeV (left) and 
14 TeV (right). The hard event simulated with Pythia 6.4 with the underlying switched off. 
Pileup from Pythia 8, tune 4C. Massless hadrons. Jets found with anti-A^, R=0.7. Only events 
with ^hardest jet > 150 GeV accepted, p used to correct for pileup contamination determined 
on the event-by-event basis in the range \y\ < 4 with the area/median method from |21j taking 
the C/A algorithm with R = 0.5. 



first term from Eq. (|5.2p contains an implicit component 0(p 2 ). As long as p is not too large 
compare to ptj PV the presence of PU particles in the jet does not affect very much the value of 
A m . However, when p is comparable with transverse momenta of genuine jet particles, then the 
active mass area computed from (|4.16|) gets a systematic shift towards larger values. 

Qualitatively, a contribution to mj PV proportional to p 2 is needed since it accounts for the 
fact that the change of mass of a jet due to pileup comes not only from clustering pileup particles 
with jet particles but also from clustering pileup particles among themselves. The latter is a 
subleading effect in powers of p. Quantitatively, to get the mass correction which is valid also 
at large p one needs a second, negative term in Eq. (|5.2p . which combined with the 0(p 2 ) 
component from the first term gives the full subleading contribution. 

We have studied the effects of mass shift due to pileup using dijet events from Pythia 6.4 [37] 
combined with pileup from Pythia 8 |38| . H All hadrons were passed through a simple calorimeter 
with cells in the (y,4>) plane of size 0.1 and rapidity coverage |y| < 4.5. Then, the calorimeter 
towers, which correspond to massless 4-vectors, were used as input to clustering algorithms. 
Jets were found with the anti-kt algorithm with R = 0.7 and only events with pt of the hardest 
jet greater than 150 GeV were accepted. 

To correct jet masses for pileup contamination one needs to know the level of the pileup 
transverse momentum per unit area, p, for each event and then apply the formula (|5.2p . To 
determine p, we used the area/median method proposed in [21] and implemented in the Fast Jet 
package [31]. The method measures p on the event-by-event basis. It starts by adding ghost 
particles to an event. Then, all particles (physical and ghosts) are clustered with an infrared 
and collinear safe jet algorithm. That leads to a set of jets, {j}, with jets ranging from hard 
to very soft. That set is used to determine p, which is defined as a median of the distributions 
of {ptj/Aj}, where ptj is the transverse momentum of the jet j and Aj is its scalar area. Using 
the median is a way to dynamically separate the soft and hard parts of an event. The method 

3 We would like to thank Gavin Salam for suggesting using this example to illustrate the procedure of jet mass 
correction. 
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leaves some freedom in the choice of the jet algorithm, the rapidity range in which p is measured 
as well as treatment of the hardest jets. Following suggestions from the literature |20|I21|. we 
used the C/A algorithm with R = 0.5 and the active area definition. Then, the median was 
determined taking all jets in the range \y\ < 4. As shown in |20U21j . for large rapidity range, 
the influence of the two hardest jets on the value of p is very small, therefore we did not remove 
them from the set of jets used for p determination. 

One has to remember that p characterises pileup in a given event only on average and that 
there are always point-to-point fluctuations. Therefore, it may happen, especially for light jets, 
that our procedure occasionally leads to negative values of m 2 . This corresponds to the cases 
with negative fluctuation in which the actual contamination from pileup to our jet is locally 
smaller than the typical level of PU in that event represented by the value of p. In such events, 
our procedure subtracts too much from a jet. There is, however, a second class of events with 
positive point-to-point fluctuation in the vicinity of a jet and for those events the correction 
based on global p is slightly underestimated. The above errors of under or overestimating the 
mass correction will cancel in quantities averaged over many events like (rrij) or (mj). To 



make sure that this happens for the latter observable, for events with m 2 , < 0, one needs to set 



In Fig. \TT\ we show the average mass of the hardest jet as a function of the number of 
pileup, i.e. additional min-bias events accompanying the production of hard dijet. The plots 
correspond to the LHC at y/s = 7 TeV (left) and 14 TeV (right). We see that the average mass 
of the hardest jet grows, approximately linearly with n. That is easily understood with our 
formula (|5.2p in which the main contribution comes from the term linear in p and it is natural 
to expect that p of pileup will scale linearly with n, which is just the number of alike min-bias 
events. We see also that the effect of pileup contamination is strong reaching up to 70% shift 
in the mass for the cases with high n. For reference, we show a horizontal line corresponding to 
the case without pileup (n = 0). Then we apply the "mass area correction" using the first term 
from Eq. (|5.2|) as well as the full correction from that equation involving both mass area and the 
p 2 term. We see that the mass area term dominates the mass shift and the correction involving 
this term alone works very well up to fairly high pileup, n ~ 12 — 15. For larger n, however, 
the second term, subleading in p, becomes necessary to get a decent value of corrected jet mass. 
We note that using both terms is equivalent to directly correcting the 4- vector of a jet with the 
help of the 4- vector area and then calculating its mass [39] . Our study from this section gives 
however a significant additional insight into the structure of these corrections. In particular, we 
have gained an understanding which contributions to the mass shift are dominant and why. 

Overall, the example from this section shows that most of the contamination of the jet mass 
due to pileup comes from the term involving mass area. On one hand, that confirms that the 
mass area is a robust characteristic of the susceptibility of the jet mass to contamination. On 
the other hand it provides a simple method to correct for that contamination and recover, with 
excellent accuracy, the value of the original jet mass. More studies of jet mass corrections, in 
particular a systematic analysis and optimisation along the lines of [3"ll21 [ l4"0" l l41| , is left for future 
work. 

6 Conclusions 

We have proposed a new characteristic of a jet, called mass area, which is supposed to measure 
the susceptibility of the jet's mass to soft background like pileup or underlying event. It is a 
close relative of the catchment area of jets introduced in the work of [3"ll21|. Two complementary 
definitions of the mass area were given suitable for two different limits of the distribution of 
UE/PU: the passive mass area, measuring the sensitivity of mass of a jet to contamination from 
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pointlike radiation, and the active mass area, more appropriate if the soft background radiation 
is diffuse and uniform. 

We have investigated the properties of the passive and active mass areas for four jet clustering 
algorithms, kt, Cambridge/ Aachen, anti-fct and SISCone by studying systems with one or two 
particles of arbitrary hardness. We have also confronted the above results with those obtained 
with more realistic jets simulated by Pythia. 

As a preparatory step, we have generalised in section [3] the results for passive and active 
areas of 2-particle jets to the case where the two constituent particles have arbitrary transverse 
momenta. This part of our study shows that even the "conical algorithms" , SISCone and anti- 
kt, rarely produce jets whose shapes in the (y,4>) plane are circular. As discussed in section [3] 
and illustrated e.g. in Fig. [21 a very simple system of two particles with comparable hardness 
leads to the whole variety of jet areas depending on the algorithm, asymmetry parameter z or 
the distance between the particles x. 

The study of mass areas of 1- and 2-particle jets, presented in section 21 reveals that similar 
richness exists also for this characteristic of a jet. A general pattern which is seen is that 
the "conical algorithms", SISCone and anti-fct, exhibit strong dependence on the asymmetry 
parameter z measuring how much of the total jet's transverse momentum is taken by the softer 
particle. On the contrary, the kt and C/A algorithms, with jets of highly irregular areas, show 
virtually no dependence on z. 

The dependence on the distance between the two constituent particles in the (y, 4>) plane, 
xR, is substantial for all algorithms though its character varies across them. The results from 
kt grow monotonically for x < 1 whereas for C/A and anti-^ they start differing from the 1- 
particle result for much larger x in the ballpark of x > 0.5. Finally, SISCone shows a completely 
different x-dependence which is the largest for x > 1, though the active mass areas changes 
significantly with x also for x < 1. 

In absolute terms, the active areas and mass areas of 2-particle jets from SISCone are much 
smaller than from the other three algorithms. It is related to the split-merge procedure, which is 
a part of the SISCone algorithm, and which results in splittings of stable cones with perturbative 
particles overlapping with cones containing only ghosts. The low active area and mass area of 
SISCone means that the jets from this algorithm will be, on average, much less contaminated 
by a soft and diffuse background than the jets from kt, C/A or anti-A^. This, in turn, will result 
in a very good pt and mass resolution. 

Our study of active mass areas of jets from MC simulation showed the same pattern of x 
and z dependence as that found for 1- and 2-particle jets. This, together with the results from 
the study of sensitivity of mass areas from different algorithms to perturbative radiation, was 
sufficient to account for most of the features of mass areas of the simulated jets. We used the 
simulated jets also to study corrections to jet masses due to contamination from pileup. We 
found that most of the systematic shift in mass caused by soft background comes in the form 
of a term involving mass area. That confirms that the mass area indeed captures the essential 
aspects of the modification of jet mass and provides simple method to correct for it and recover 
its original value. 

As for the the comparison between the areas and the mass areas of jets, we have seen that, 
qualitatively, the two characteristics exhibit similar behaviour. Quantitatively however, the 
effects observed for mass areas are always significantly bigger, a fact that we associate with an 
additional power of angle in the definitions of the mass areas with respect to the areas. 

We envisage several ways in which the concept of mass area, introduced in this paper, could 
be used. Firstly, as a measure of the susceptibility of the jet's mass to contamination from soft 
radiation, it provides a guidance for choosing a given jet algorithm for a given purpose. For 
example, to minimise the systematic error in determination of mass of a jet one may choose the 
algorithms whose mass area is smallest. Secondly, knowing how the mass area depends on the 
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relative hardness of the subjets may help in designing a discriminating variable which in turn 
could allow one to separate the QCD jets from the jets coming from decays of heavy objects. 
In this context, one could consider, for instance, using the mass area as an additional variable 
entering the Boosted Decision Tree [42 j . It would be also very interesting to further study the 
corrections of jet masses for the contamination from soft background. Possible extensions of the 
analysis from section 15.21 could involve an optimization of jet definition as well as correcting for 
contamination from underlying event. Finally, it would be worth investigating the effects the 
new procedures of noise reduction, namely filtering [6], pruning [llj and trimming [12j . have on 
the mass area of jets. We believe that all the above possibilities are worth investigating with 
jet events from Monte Carlo simulations. We leave these questions for future work. 
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A Passive areas and mass areas for general 2-particle system 

In this appendix, we collect all analytic results for passive areas and mass areas of the hardest 
jet in the system with two particles of arbitrary transverse momenta. The calculations were 
done within the E recombination scheme and in the limit of small R. The latter corresponds 
to neglecting the difference between y and eft dimensions and was used consistently throughout 
the paper for both passive and active quantities. All the analytic results presented here were 
confirmed for a range of values of z by numerical study with Fast Jet. 

As discussed in sections [3j the area and the mass area of the hardest jet in the system with 
two particles depends on the interparticle distance in the (y, <f>) plane. For each algorithm, one 
distinguishes several subranges of the range < x < 2. In each such subrange the x-dependence 
of the area and mass area is described by a different function. 

Table [5] summarises the results for passive area and mass area of the hardest jet from four 
algorithms normalised to the 1-particle result, i.e. irR 2 or 7ri? 4 /2, respectively. The critical 
x values, x c i,..,4, were defined in Eqs. (|3.6p . (j3.7|) . (|3,9p and (j3. 13[) . 

The functions from table O different for areas and mass areas are defined below. To make 
the notation more concise we define the auxiliary function 

f(x, x') = y/(\ - (x - x') 2 ) ((x + x' f - 1) , (A.l) 

(A.2) 

and the two following variables 

z 

z = ^ , x = zx , (A. 3) 

where x is the interparticle distance in units of the jet radius, defined in Eq. (|3.2p . and z is the 
asymmetry parameter from Eq. (|3.ip . 
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passive area (or mass area) normalised to 1-particle result 


x range 


h 


C/A 


SISCone 


&nti-k t 


[0, x c i] 


9x(x) 


Px(x) 


Px(x) 


w~(x) 


[x c l,x C 2] 


Qx(x) 


[Xc2,X c3 ] 


r x (x) 


u x (x) 


[X C 3, 1] 


v x (x) 


[l,x ci \ 


h x {x) 


h x (x) 


tx{x) 


w x (x) 


[x c4 ,2] 


h x (x) 


1 



Table 5: Passive areas and mass areas of the hardest jet in a system with two particles of 
arbitrary hardness. The critical values of x, x c i t ... t 4, are defined in Eqs. (j3.6fl . (j3.7|) . (|3.9p and 
(|3,13p . The functions, corresponding to given range of x are given in subsections lA.il for areas, 
and lA.2l for mass areas. The subscript in function name, X = a, ma, refers to the corresponding 
quantity. To obtain the area or mass area in a given range of x, the appropriate function from 
the table should be multiplied by the 1-particle result, nR 2 or 7ri? 4 /2, respectively. 



A.l Areas 

The passive areas of 2-particle jets are fully specified for the four algorithms by the following 
set of functions to be used with table [5] 



1 



g a (x) = -{x\ 1- — + 2 



7T 



;t - a.rccos ( — 



(A.4) 



h a{x) = -9a{x) , 
Pa{x) = 1 , 

( s 1 / . f(x,Xj) 2 

q a {x,xj) = -< 7T H h x 

7T Z 



(A.5) 
(A.6) 
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7T 
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2xxj 



+ arcsin 



x 2 + (x - xj) 2 - 1 
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f(x,x- Xj) 



arctan 
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2x(x — xj 

x 2 - (x - xj) 2 - 1 
f(x,x- Xj) 
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f(x,xj) 
2 

(A.8) 



7T 



2 arccos 
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v a (x,xj,z,x) 
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arcsm 



1-z 2 
2x 2 z 



v / 4x 2 -(l + x 2 -z 2 ) 2 



(A.12) 



For the kt algorithm, the general 2-particle result is identical with that found in [3] for the 
case with strongly ordered transverse momenta. The results from the other three algorithms 
depend on z via xj (see Eq. (|3.3p ). in the case of C/A and SISCone, and also explicitly in the 
case of anti-Zej. One can also verify that in the limit z — >• 1/2, the function v a reduces to r a and 
the function w a to h a . Therefore, the areas from C/A and anti-fc^ become identical in this limit, 
as seen in Figs. [3] (left) and [31 



A. 2 Mass areas 



Here we gather all the functions needed, together with table to compute the passive mass 
areas of the hardest jet in the 2-particle system for the four algorithms. The definition of the 
passive mass area (j4.3[) introduces extra ^-dependence via the transverse momentum of a jet, 
Ptj. In line with all the other calculations presented in this paper, also here, we have assumed 
that the jet radius is small. This allows one to approximate the full jet's transverse momentum 
by Ptj — Pti + Pt2 The results for the normalised 2-particle mass areas are 



gma(x) = i<j vr (1 + x 2 ) + | (6 + x 2 ) ^1 - y + 2 (1 + x 2 ) arcsin (| 
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7T 
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qma(X,Xj,z) = - < ~ 
7T Z 



+ 



l r 
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(A.13) 

(A.14) 
(A.15) 
5xj(l + x 2 ) 
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(A.16) 



Tma (x, Xj , Z) 



- - 

7T 4xj - 



f(x,xj) 

5x j(l + x 2 ) + Xj — 4x(l + x 2 — x 2 ) + 4zx(l — x 2 — xxj + x 2 ) /(x, x j) 



4 For real jets, however, there will be some difference depending on whether the two constituent particles are 
oriented along the rapidity axis or along the <j> axis. We have performed corresponding numerical study and found 
that this difference is indeed negligible except for a very specific case of large (x ~ 2) and symmetric (z ~ 1/2) 
jets from the SISCone algorithm where it can get up to 20%. 
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As for the areas, also here all results except those from the kt algorithm depend on z, both 
explicitly and via xj. As discussed in section fO| the z-dependence of the mass area of the kt 
jets vanishes due to additional reflection symmetry. Similarly, for anti-fcj, the function v ma goes 
to r ma and w ma to h ma in the limit z — > 1/2. Hence, the C/A and anti-/ct mass areas become 
identical as shown also in Figs. [6] (right) and [7] (right). 
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Figure 12: Active mass area of the harder jet from SISCone in a 2-particle system with strongly 
ordered transverse momenta. The curve corresponds to the analytic result (|B.1|) . At x = and 
x = 2 the 1-particle result, 1/16, is recovered. 



B Active mass areas from SISCone for strongly ordered 2-particle 
system 

The active mass area of the hardest jet in the system of two particles with strongly ordered 
transverse momenta (e.g. from QCD splitting) is given by 
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with the following definitions 
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2-x 

u 2 (x) = 247r(4 ~_ x2)3/2 (10 + 5x 2 - 6x 4 + x 6 ) + ^ (vr - arccos g)) , (B.3) 

u 3 (x) = 48 ^. x 3 |3ttx 3 + 2 a/4 - x 2 (2 + x 2 ) - 3x 3 arccos (|) 1 . (B.4) 

The corresponding curve in shown in Fig [T2l Eq. (jB.ip was used directly, together with 
the definitions (|4.18|) and (|4.22p to obtain the anomalous dimension and the corresponding 
fluctuation coefficient given in table [H 
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